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Abstract 

Consider a list of n files whose popularities are random. These files are updated according to the move-to- 
front rule and we consider the induced Markov chain at equilibrium. We give the exact limiting distribution of 
the search-cost per item as n tends to infinity. Some examples are supplied. 
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1 Introduction and model 



Consider a list of n files which is updated as follows: at each unit of discrete time, a file is requested independently 
of the previous requests and is moved to the front of the list. This heuristic is called the move-to-front rule and was 
first introduced by [14] and [10] to sort files. Such strategy is used when the request probabilities are unknown, 
otherwise we would list the files in order to have decreasing request probabilities. The move-to-front rule induces a 
Markov chain over the permutations of n elements which has a unique stationary distribution, (see [3] and reference 
to the work of Hendricks, Dies and Letac therein). This distribution turns out to be the size-biased permutation of 
the request probabilities. 

Here, we consider that these request probabilities are themselves random, as in a Bayesian analysis. Let uu = 
(wj)j e iv» be a sequence of iid positive random variables. The Laplace transform of a weight will be denoted by 
4> and its expectation by [i. For any i £ JV*, Wj represents the weight of the file i. We can construct request 
probabilities p = (pi, ..,p n ) as follows: 



Such random vector p is called a random discrete distribution [8]. 

Let us denote by S n the search cost of an item (i.e. the position in the list of the requested item) when the underlying 
Markov chain is in steady state (the first position will be 0). For this model, [2] obtained exact and asymptotic 
formulae for the Laplace transform of S n (some results were also extended to the case of independent random 
weights). In particular, they found the limit of the expectation and the variance of S n . Moreover, in the case of 
i.i.d. gamma weights, [1], obtained the exact and asymptotic distribution of S n , using an exact representation of the 
size-biased permutation arising from Dirichlet partitions. Note that [5] found the limiting distribution of S n when 
weights are deterministic but non-identical, in some cases (uniform, Zipf 's law, generalized Zipf 's law, power law 
and geometric). 

In section 2, we shall give a general formula for the density of the limiting search cost distribution S, provided that 
the expected weight is finite. Then we derive the moment function and the cumulative distribution function of S. 
We also discuss the relationship between the move-to-front rule and the least-recently-used strategy. In section 3 
we study some examples for which computations can be done explicitly: both continuous and discrete distributions 
are considered. 

2 Limiting search cost distribution 

The early analysis of the heuristic move-to-front focused on the expected search cost, see [10], [8] and [7], for 
instance. Later, researchers paid much attention to the (transient and stationary) distribution of the search cost 
([6]). Some of them investigated the limiting behavior as the number n of items tends to infinity (see [5]). In a 
more recent article, [2] obtained an integral representation of the Laplace transform of S n in the Bayesian model 
described in the introduction. Their main theorem is the following: 



n 



Vz G {1, ...,n} , 
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Theorem 2.1 For a sequence to ofiid positive random variables, 

VOO, <j> Sn {s)=n / ( f>"(r)[(f>(r)+e- s (4>(r-t)-0(r))] n - 1 drdt. 
Jo Jt 

In the same article the integral representation for the two first moments of S n were derived. Moreover, they 
obtained a point-wise asymptotic equivalent for the Laplace transform of S n and the limit of the first two moments 
of S n /n when the number n of items tends to infinity. From theorem 2.1, we can obtain the following closed-form 
expression for the density function of the limiting distribution of Sn/n: 

Theorem 2.2 For a sequence u> ofiid positive random weights with finite expectation /i, 

S n d „ 

> S , 

fl n—>oo 

where S is a continuous random variable with the following density function fs-' 

IS{X) = V (0-1(1 - a) ) V-Po](*) • (D 
where po = P(u>i = 0) and (f)^ 1 is the inverse function of '</>. 

Remark 2.1 The quantity p n can be interpreted as follows: p is the probability that an item is never requested. At 
stationarity, one expects that any such item will be at the bottom of the list: np is the mean number of unrequested 
items. So it is not surprising that the support of S is not the entire unit interval. Note that if the distribution of the 
weight is continuous, then p = 0. 

Proof We have to prove that S n /n converges in distribution, as n tends to infinity, to a certain random variable 
that will be denote by S. First, observe that: 

VO0, cj> Sn/n {s) = S „ (£) • 

So we are now interested in the limit of 4>s n (s/n). 
For any reals a and b such that < a ^ b ^ oo, let: 

( h r / "1 n—l 

I n (a, b) - / 4>"{r) 4>(r) + e- s / n {4>(r - t) - 4>{r)) dr . 

J a 

If b = oo, then we will omit this parameter, i.e. I n (a) = i n (a, oo). Using these notations, theorem 2.1 gives: 

(-OO 

4sAi)= n I n (t)dt. (2) 
Jo 

We now decompose I n (t) into two parts: I n (t) — I n (t, t + e)+I n (t + e). We will prove that nl n (t + e, oo) tends 
to when n tends to infinity: 

n-l 



>t+e 

< n 



rOO 

nl n (t + £,oo) = n 4>"{r) e- s ' n {4>{r-t) + {l-e- s / n )^{r)) 

Jt+e L 
/•oo 

, / 4>"(r)4>{r -t) n - x dr , 

Jt+s 



dr 



>t+e 

< -ruf>(e) n - V(* + e) , 
since <p is decreasing. Then lim n _ >00 nl n (t + e, oo) = 0, for all e > 0. 
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Now we will estimate I n {t, t + e). Let h n (r, t) = <j)(r) + e s / n {(f){r — t) — 4>{ r ))- For a fixed value of t, the 

dh n 
dr 



function h n (-, t) behaves as 4>. In particular is an increasing function for r e [t, t + e\. Then we obtain the 



following bounds: 



and: 

4>"(t + e)^4>"(r) <^"(t). 

Hence, we can bound J n (i, t + e) by: 

+ - j t %''(r)(h n {r,t)) n - 1 ^(r,t)^(r^ dr 

< ^"(^(t 1 t)- 1 ^(^(i+M)r-(^(t,t)n ■ 

Proceeding similarly, we can find a lower bound: 

J„(i, t + e) > 0"(t + e)^(t + e, ty 1 ^- [{h n {t + e, t)) n - (h n (t, t)) n ] . 

Then, for any s > 0, one can prove the following limits hold: 

lim (h n (t + e,t)) n = 0, 

lim {h n {t,t)) n = exp [-s(l -</>(*))] , 

n^oo 

lim ^(M) = 
lim ^(t + e,t) = 0'(O). 

n^oo or 

Replacing these limits in the equations above, we have computed upper and lower bounds of I n (t, t + e). In other 
words, if the limit of nl n (t, t + e) exists, then it is bounded by: 

- ri jl £) exp (-(1 - <P(t)» < I™ nl n (t, t + e) ^ -^f\ cxp (-(1 - 0(t))a) . 

This is true for any e > 0; then letting e tends to 0, we have: 

lim nl n (t) = exp (-(1 - <f>(t))s) . 

Replacing this limit in equation (2) we obtain 

lim <f> Sn /n(s) = ~ / 4>"{t)e-^-^ s dt , (3) 

which will be denoted by <ps (s). Although this limit a priori is not necessarily the Laplace transform of a random 
variable, according to the Continuity theorem (page 431 Ch. XIII in [4]), one has to check that lim s ^o (f>s{s) = 1, 
which can be proved by using the dominated convergence theorem. 

A suitable change of variable y = 1 — </>(r) in equation (3) gives: 

li J <j)'((t> l (l~y)) 
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where for the integral limits we used the property that <p(oo) = po (see [4] remark in theorem 1(a) page 439 Ch. 
XIII). Therefore, we have that: 

fs{y) = ^^'^ 

is the probability density of S. □ 

As a corollary to this theorem, we can compute the q-th moment and the cumulative distribution function (c.d.f.) 
of S: 

Corollary 2.1 For any q G R 

i r°° 

E[S«] = - / (l-0(t))V'(*)dt, 
M Jo 

and, for any x € [0, 1], 

/ 1 /^"'(l-z) \ 

P{S^x)=l-J 4>"{t)dt \ l [0 ,i_ Po] (a;) + ]l ( i_ POi i ] (a:). 



One could be interested in the cumulative distribution function of S (or more precisely in the survival function), 
since the move-to-front rule is related to the least-recently-used strategy (see [7] for instance). Indeed, many 
operating systems or softwares use a memory (also called cache) that could be quickly addressed (think of a web 
browser, for instance). Hence, one needs to define a strategy to organize it. Let us consider that the cache is made 
of k files. The least-recently-used strategy is the following: at each unit of discrete time, a file is requested and is 
moved in front of the cache; if the file was not just previously in the cache, then the last file is deleted from the 
cache and all other files are shifted by one position to the right; if the file was just previously in the cache, then the 
file is moved exactly as in the move-to-front rule. So, the move-to-front rule can be viewed as a special case of the 
least-recently-used strategy for which the length of the cache is equal to the number of files (k = n). An important 
question arises: what is the probability that the requested file is not in the cache? The probability of this event 
is called the page default; we will denote it by Hk in the sequel. Because of the link between the move-to-front 
rule and the least-recently-used strategy (as underlined above), we clearly have that 7Tfc = P(S n ^ k). So, if we 
assume that the cache length is proportional to the number of files, say k — an with a <G [0, 1] fixed, for a large 
collection of files, the following approximation holds: 



b"(t)dt 

A* JO 

if a < po and ir an ~ 1 otherwise. 



1 n 

M Jo 



3 Examples 

In this section, we study some examples for which we are able to do explicitly all computations. We will consider 
both continuous and discrete distribution for the random weights. 

Example 3.1 Suppose that the weights have the Dirac distribution at point mass 1 (in other words, weights are 
deterministic and are equally requested). Then 4>{r) = e~ r , the expectation u = 1 and p = 0, we deduce that: 

f Sl (x) = l[o,i](z) • 
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Thus, Si has the uniform distribution over [0,1]: this result was already proved in (theorem 4.2, p. 198 of [5]). The 
fc-th moment (with k E R + ) and the c.d.f. of Si is: 

= ^-j-y and Vx e [0, 1] , F Sl (x) = P(Si x) = x . 

Example 3.2 Suppose that the weights have the Gamma distribution with parameter a > 0. In this example, 
the random vector (pi, . . . ,p n ) has the symmetric Dirichlet distribution D n (a) (see [15] or [9]). In such a case, 
Po = 0, fi = a and <f>(r) = (1 + r)~ a . Computations give: 

/s 2 M=(l + £) (l-.x) 1 /«l [0 , 1] (x), 

which is the density function of the Beta distribution with parameters (1, 1 + 1/a). Note that this result has already 
been proved by [1] with a specific technique using properties of Dirichlet distribution (in this case we were able 
not only to find the limiting search cost distribution but also the transient search cost distribution for any finite n). 
The k-th moment (with k E R + ) of S2 is: 

_ r(fc + i)r(2 + i) 
E[Sa] ~ r ( 2 + fc + i) • 

In particular, we have EfSy = 2 a+\ an ^ VarfSy = (sa+r)(2a+i) 2 ■ ® ne can a ^ so com P ute the c -d.f of S2 and, for 
any x E [0, 1], we get: 

F S2 (x)=P(S 2 ^x) = l-(l-x) 1+1 / a . 

We can easily deduce that, for any x E [0,1], Fs 2 (x) ^ Fs 1 (x), where Fs 1 (-) = 1 — Fs 1 (-). So we have 
S2 dist Si (where < st denotes the usual stochastic ordering; see [12] or [13], for instance). 

Example 3.3 Suppose that the weights have the Geometric distribution on IV with parameter p E (0, 1). In such 
case, po = p, fi = (1 — p)/p and (f>(r) = p/(l — (1 — p)e~ r ). Elementary computations give: 

fs 3 (x) = Y~p ^[o.i-pIW • 



The k-th moment (with k E IR+) of S3 is: 

fc _ (2+pk)(l-p) k 
^ (fc + l)(fc + 2) ' 

In particular, we have E[S 3 ] = (2+p) 6 (1 ~ p) and Var[5 3 ] = ^-p^+^p-p 3 ) , One can also compute the c.d.f of S 3 
and, for any x E [0, 1], get: 

F S3 (x) =P(S 3 < x) = X( " 2 ~ P p X) l[o,i- P ]W + l(i- P ,i](*) • 
Hence, from the above expression, one can check that S4 < st Si. 

Example 3.4 Suppose that the weights have the Poisson distribution with parameter A. In such case, p = e~ x , 
li = A and <j>(r) = cxp (Ae~ r — 1). Simple computations give: 

. , ln(l - x) + A + 1 •« 

JSA X ) = ^ ^[0,l-e-^]W • 

Using formula 1.6.5.3 of [1 1] (page 244), one can compute the fc-th moment (with k E N) of 54: 



A+(l-e- A ) fc+1 -J2 
1=1 



(i-e-*y 
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In particular, we have E[Si] = \ — — — . One can also compute the c.d.f of S4 and, for any x G [0, 1], we get: 

F Si (x) = P(S 4 < x) = {x - ^(1 - x) ln(l -x))t [o,i_e-*] (a:) + 1 ( i_ e -\i] (z) . 
Thus, from the expression above, one can deduce that S4 ;< st Si. 

From the study of these four examples, one can observe that both S2, S3 and S4 are stochastically smaller than 
Si. Hence, the following conjecture looks appealing: 

Conjecture 3.1 Let S be the limiting distribution of the search cost associated to a sequence uj of iid positive 
random variables. Then, S < st S\ where Si is a random distribution having the uniform distribution on the unit 
interval. 

This conjecture is compatible with some remarks in [2], more precisely with proposition 3.1 therein. Indeed, if 
the conjecture is right, then as a consequence we have E[S] ^ E[S\] = \. And this is precisely what is stated 
in proposition 3.1. This conjecture can be interpreted as follows: the case with Dirac weights corresponds to the 
worst case. Despite our conjecture seems to be true, its proof seems to be difficult. 
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